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To analyze the evolutionary emergence of structural complexity in physical processes we introduce 
a general, but tractable, model of objects that interact to produce new objects. Since the objects — t- 
machines — have well defined structural properties, we demonstrate that complexity in the resulting 
population dynamical system emerges on several distinct organizational scales during evolution — 
from individuals to nested levels of mutually self-sustaining interaction. The evolution to increased 
organization is dominated by the spontaneous creation of structural hierarchies and this, in turn, is 
facilitated by the innovation and maintenance of relatively low-complexity, but general individuals. 
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Long before the distinction between genetic informa- 
tion and functional molecules — molecules encoded by, 
but manipulating that stored information — there were 
objects that simply interacted and mutually transformed 
each other. How did structured objects with the dual 
roles of information storage and transformation emerge in 
such initially disorganized environments? Here we intro- 
duce a class of models that allows us to explore this ques- 
tion, in a setting often referred to as pre-biotic evolution 
m . In contrast with prior work, we focus on the questions 
of what levels of structure and information processing 
can emerge and, specifically, what population-dynamical 
mechanisms drive the transition from pre-biotic to biotic 
organization. 

One of the key puzzles in this is to understand how 
systems, on the one hand, produce structure that, on the 
other, becomes substrate for future functioning and inno- 
vation. The spirit of our approach to this puzzle follows 
that suggested by Schrodinger ^] and found in von Neu- 
mann's random self-assembly model |3- However, our 
model is more physical than chemical in the sense that we 
do not assume the existence of sophisticated chemical en- 
tities, such as macromolecules, nor do we even use chem- 
ical metaphors, such as information being stored in one- 
dimensional arrays — in aperiodic crystals, as Schrodinger 
presciently proposed. While ultimately interested in pre- 
biotic organization and its emergence, our focus is on 
what one might call pre-chemical evolution. As such, 
the following provides a first step to directly address how 
structural complexity and evolutionary population dy- 
namics interact J^. 

Here we introduce a model of the emergence of orga- 
nization and investigate it in a setting that, at one and 
the same time, provides a well defined and quantitative 
notion of structure and is mathematically tractable. The 
goal is to develop predictive theories of the population 
dynamics of interacting, structured individuals and their 



collective organization. With a well defined measure of 
structural complexity one can precisely state the question 
of whether or not complexity has genuinely emerged over 
time in pre-biotic and pre-chemical processes. Addition- 
ally, with a predictive theory of the population dynamics 
one can identify and analyze the (at some point evolu- 
tionary) mechanisms that lead to such increases (and to 
decreases) in structural complexity. 

We create a well stirred population — the finitary pro- 
cess soup — of initially random, finite e-machines |5J 
that interact and transform each other, making new e- 
machines of differing structure and so of differing trans- 
formational properties. The initial random soup serves as 
a reference that, for reasons to become apparent below, 
has "null" structural complexity, both in the individuals 
(on average) and across the population. Here we consider 
the case of a process gas: objects, e-machines Ta and Tb, 
are successively randomly paired {pan-mixia) and act on 
each other to create progeny: Tb +Ta ^ Tq- No exter- 
nally applied selection or variation is imposed. 

An e-machine T — {S, T} consists of a set of causal 
states S and transitions T between them: T^^ , s G A. 
We interpret the symbols labeling the transitions in the 
alphabet A as consisting of two parts: an input symbol 
that determines which transition to take from a state and 
an output symbol which is emitted on taking that tran- 
sition, e-machines have several key properties 5^: All 
of their recurrent states form a single strongly connected 
component. Their transitions are deterministic in the 
specific sense that a causal state together with the next 
input symbol determines the successor state. And, S is 
minimal: an e-machine is the smallest causal representa- 
tion of the transformation it implements. 

Due to these properties, one can quantify an e- 
machine's structural complexity. To do this we need 
the probability distribution over the states in <S, how 
often they are visited, and this is given by the normal- 



ized left eigenvector associated with eigenvalue 1 of the 
stochastic matrix T = X]se-4 -^^*''- Denote this eigen- 
vector, normalized in probability, by ps- An e- machine's 
structural complexity is the amount of stored informa- 
tion: Cf,{T) = -E„esPs^log2Ps^- When C^ is finite, 
we say the e-machine (or, more properly, the transforma- 
tion it describes) is finitary. 

Thus, unlike previous models — such as A-expressions 
[g, machine instruction codes [3, tags Q, and cellular 
automata [9j — e-machines allow one to readily measure 
the structural complexity and disorder of the transforma- 
tions they specify. It is well known that algorithms do 
not even exist to measure these quantities for machine- 
language programs and A-expressions, for example, since 
these are computation-universal models jlflj. As our re- 
sults demonstrate, these tractable aspects of e-machines 
give important quantitative, interpretive, and theoretical 
advantages over prior work on the pre-biotic evolutionary 
emergence of structural complexity. The finitary assump- 
tion is also consistent with the recent proposal that gene 
expression is implemented with finite-memory computa- 
tional steps |1 ij l . 

We should emphasize that, in these finitary process 
soups and in contrast with prior work, e-machines do not 
have two distinct modes of representation or functioning. 
The objects are only functions, in the prosaic mathemat- 
ical sense. Thus, one benefit of this model of pre-biotic 
evolution is that there is no assumed distinction between 
gene and protein |3, y| , between data and program |l3| . 

Finitary process soups allow one to quantitatively an- 
alyze not only the structural complexity of individuals, 
but also the interaction between individual structure and 
population organization and dynamics in terms of how 
they store and process information and the causal archi- 
tecture that supports these. Since this view of a system 
intrinsically computing applies both to individuals and 
to the population as a whole, we can identify the locus 
of a population's structural complexity. Is it largely the 
sum of the individuals' or largely embodied in the trans- 
formative relationships between individuals? Perhaps it 
derives from some irreducible interaction between these 
levels. 

The finitary process soup differs from early investiga- 
tions in which finite-state machines were evolved using 
an explicit mutational operator jl^. Here, novelty de- 
rives directly from how the objects themselves are struc- 
tured, since this determines how they transform each 
other. Equally important, survivability is determined by 
an individual's rate of reproduction — the original biolog- 
ical notion oi fitness; there is no externally imposed fit- 
ness function. In this, the process soup is similar to the 
molecular evolution models of Eigen and Schuster [l3| . 

A population P is a set of N individuals, each of 
which is an e-machine. More compactly, one can also 
describe the population as a distribution of e-machine 
types: f = (ai/N, a2/iV, . . . , an/N), where n is the num- 



ber of possible e-machine types and Ui is the number of 
individuals of type T^. A single replication is determined 
through compositions and replacements in a two-step se- 
quence: First, construct e-machine Tc by: 

1. With probability 1 — $i,i, forming the composition 
Tc = Tb ° Ta from Ta and Tb randomly selected 
from the population and minimizing 10]. 

2. With probability <I>in, generating a random Tc- 

Second, replace a randomly selected e-machine, Td, with 
Tc- $in is the rate of influx of new (random) e-machines. 
When <i>in — 0, the soup is a closed system. When $in = 
1, the soup is open, but consists of entirely random e- 
machines and so is unstructured. The initial population 
Po — with f = (1/n, . . . , 1/n) — is similarly unstructured. 

As a first step to detect population structure we define 
the interaction network Q as the e-machine compositions 
that have occurred in the population. For a population 
with n different types, Q is described by an n x n matrix 
the entries of which are the machine types returned by 
the compositions Ti o Tj, Ti,Tj E P. We represent G 
as a graph whose nodes are the machine types in the 
population and whose directed edges connect one node, 
say Ta, to another, Tc, when Tc = Tb o Ta- The edges 
are labeled with the transforming machine Tb- We also 
represent Q as a transition matrix Q^- = P{Tk\Ti,Tj), 
when Tk — Tj o Tj . 

The second step is to introduce a natural notion of 
organization that encompasses interaction and dynamic 
stability, we define a meta-machine as a set of e-machines 
that is both closed and self-maintained under composi- 
tion. That is, rj C P is a meta-machine if and only if 
(i) T,oTj en, for aU Ti,Tj G VL and (u) for aU Tk^VL, 
there exists Ti,Tj e n, such that Tk ^ Ti o P,. This 
definition of self-maintenance captures Maturana et al's 
autopoiesis [lj|, Eigen and Schuster's hypercycles |l.ll |. 
and autocatalytic sets |6l llSj. In a process soup, awash 
in fiuctuations and change, a meta-machine is a type of 
organization that can be regarded as an autonomous and 
self-replicating entity. Note that, in this sense, the ini- 
tial random soup Pq is not organized. To the extent that 
interaction networks persist, they are meta-machines. 

To measure the diversity of interactions in a population 
we define the interaction network complexity C^j,{Q) = 



E 



P.,P„P.>o4log2 4'^here 



PiPj/ Z) ""^^ Tk = TiO Tj has occurred. 



0, 



otherwise. 



(1) 



Finally, a machine type's frequency changes at each 
generation according to its interactions and is given by 

n 

ff ^ = f.-i • ej' • ft-i/ Y. f*-i • ^?^ • f*-i ■ (2) 

fc=l 



Let us now explore a base case: the population dynam- 
ics of one simple subset of e-machines, those consisting 
of only a single state. This class is especially instructive 
since it is closed under composition: the composition of 
two single-state machines is itself a single-state machine. 
There are 15 single-state e-machines; excluding the null 
machine. As a consequence, there is a finite number of 
possible interactions and this, in turn, greatly facilitates 
an initial analysis. Although a seemingly trivial case, 
a population of these machines exhibits nontrivial dy- 
namics and leads to several insights about unrestricted 
populations. 





FIG. 1: Population dynamics starting with A'^ = 10 ran- 
domly sampled single-state e-machines: Fraction ft of e- 
machine types as a function of time t (number of replications). 
Simulations (dashed lines) and theory Eq. (|5J (solid lines). 



The first obvious evolutionary pressure driving the sys- 
tem is that governed by trivial self-reproduction (copy- 
ing). e-Machines with the ability to copy themselves 
(directly or indirectly) are favored, possibly at other e- 
machines' expense or in symbiosis with other e-machines. 
The number of such self-reproducing machines grows in 
relation to the whole population, further increasing the 
probability of self-reproduction. Interaction networks 
that sustain this will emerge, consisting of cycles of co- 
operatively reproducing e-machines. These are chains of 
composed mappings that form closed loops. 

Figure ^ shows the dynamics of a population sam- 
pled from fo = (1/15, . . . , 1/15) and in a closed system 
(<I>in = 0) of A^ = 10^ e-machines. The figure shows that 
Eq. ^ predicts the simulations quite well. Out of the in- 
teractions between all possible e-machines the population 
settles down to a steady-state interaction network of nine 
e-machines. Figure |21 shows this meta-machine. Note 
that the structural complexity of individual e-machines is 
always zero: C^{T) = for single-state machines. Thus, 
the population's structural complexity is due solely to 
that coming from the network of interactions. 

With ways to predict the population dynamics and 
to detect the emergence of structural complexity in the 



FIG. 2: Organization of the steady-state population — a meta- 
machine: The interaction graph Q after lO" replications, at 

the end of Fig. Q The graph shows only Ta —^ Tc-denoted 
interactions. C^{Q) ^ 5.75 bits. 



soup, we now turn to the evolution of unrestricted popu- 
lations. We summarize the results using the population- 
averaged e- machine complexity {C^{T)) and the run- 
averaged interaction network complexity {C^{Q)) as a 
function of time and influx rate; see Fig. One observes 
an initial rapid construction of increasingly complex in- 
dividuals and interaction networks. In the closed system 
($111 = 0), both of these reach a maximum and then 
decline to less complex steady states within a small sub- 
space of possible structures. In fact, both structural com- 
plexities effectively vanish at this extreme. The closed 
system specializes, ages, and eventually dies away. At 
the extreme of high influx (<I>iii = 1), when the popula- 
tion looses the ability to store information, the network 
complexity vanishes and the individual complexity be- 
comes that of a purely random sample of e-machines. 

Away from these extremes, the evolution of the open 
systems' network complexity is maximized at an interme- 
diate influx rate ^m ~ 0.10. Notably, the emergence of 
complex organizations occurs where individual e-machine 
complexity is small. Survival, however, requires these in- 
dividuals to participate in interaction networks and so 
to interact with a variety of other machines; they are 
generalists in this sense. At higher influx {^i^ ~ 0.75) 
large {Cfj,{T)) is correlated with markedly less complex 
networks. These more complex machines are specialized 
and do not support robust complex interaction networks. 

It turns out that the maximum network complexity 
C^{Q) grows slowly (linearly) with time. It is ultimately 
capped by the population size since there is only so much 
structure that can be built with a finite number of compo- 
nents. More extensive investigations show that it grows 
in an unbounded way — C^{Q) oc logA^ — indicating the 
possibility of reaching highly structured populations at 
large sizes. 

The finitary process soup demonstrates (i) that com- 
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FIG. 3: (a) Population-averaged e-machine complexity 
{Cfj,{T)) [bits] and (b) run-averaged interaction network com- 
plexity iC^(Q)) [bits] versus time t and influx rate <E>in for a 
population of size A^ = 100 averaged over 50 runs at each $in. 



plexity of the entire system arises mainly from the trans- 
formative relationships between individuals and (ii) that 
those individuals tend to be noncomplex and to im- 
plement general rather than specialized local functions. 
Thus, the population dynamics makes a trade-off: sim- 
pler individuals facilitate the emergence of global struc- 
ture. Conversely, for a system to become complex, it 
is not necessary to evolve complex individuals. The 
results strongly suggest that replicative processes will 
use this particular strategy to build successively higher 
levels of structural complexity from the compositional 
("metabolic") network of interacting finitary compo- 
nents. In this way, the finitary process soup evolves 
higher computational representations, they are not built- 
in or accessible at the outset. 

The finitary process soup is a model of endogenous 
evolution. In particular, fitness is defined and measured 
in the biologically plausible way, as the rate of individual 
reproduction, and there is no externally imposed muta- 
tional operator. It is also a flexible model; we showed 
pan-mixia replication and will report on populations with 
spatial structure elsewhere. Moreover, it is extensible in a 
number of ways. Specifically, it is straightforward to cou- 
ple in energetic and material costs of composition. This 
will allow one to analyze trade-offs between energetics, 
dynamics, and organization. Finally, one of the key de- 
terminants of evolutionary dynamics is the structure of 
selection- neutral genotype networks; see Ref. jj] and ref- 
erences therein. The neutral networks of e-machines and 
of e-machine networks can be directly probed. 

We close with a few general comments. Given that or- 
ganization in a population becomes hierarchical, we be- 
lieve that powerful computational representations, when 
employed as the basic objects, are neither effectively used 
by nor necessary for natural evolutionary processes to 
produce complex organisms. We hypothesize that indi- 
viduals with finitary computational capacity are appro- 
priate models of molecular entities and transformations. 
From these, more sophisticated organizations and func- 
tions can be hierarchically assembled. This can only be 
tested by experiment, of course, but this will soon be 



possible. 

It has been recently estimated that the genomes of 
higher species consist of a surprisingly small number of 
gen es compared to the number found in lower species 
|ld \li\ , despite the higher species being markedly more 
complex and diverse in their behaviors. Moreover, many 
of those genes serve to maintain elementary functions and 
are shared across species. These observations accord with 
the evolutionary dynamics of the finitary process soup: 
global complexity is due to the emergence of higher level 
structures and this in turn is facilitated by the discovery 
and maintenance of relatively noncomplex, but general 
objects. In both the genomic and finitary soup cases, 
one concludes that an evolving system's sophistication, 
complexity, and functional diversity derive from its hier- 
archical organization. 
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